Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 281880 |
| Missing cells | 44736 |
| Missing cells (%) | 1.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 30.1 MiB |
| Average record size in memory | 112.0 B |
Variable types
| Categorical | 4 |
|---|---|
| DateTime | 1 |
| Numeric | 9 |
VERSIE has constant value "1.0" | Constant |
DATUM_BESTAND has constant value "2021-09-13" | Constant |
PEILDATUM has constant value "2021-09-01" | Constant |
TYPERENDE_DIAGNOSE_CD has a high cardinality: 1770 distinct values | High cardinality |
BEHANDELEND_SPECIALISME_CD is highly correlated with AANTAL_PAT_PER_SPC | High correlation |
AANTAL_PAT_PER_ZPD is highly correlated with AANTAL_SUBTRAJECT_PER_ZPD | High correlation |
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD | High correlation |
AANTAL_PAT_PER_DIAG is highly correlated with AANTAL_SUBTRAJECT_PER_DIAG | High correlation |
AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG | High correlation |
AANTAL_PAT_PER_SPC is highly correlated with BEHANDELEND_SPECIALISME_CD and 1 other fields | High correlation |
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with AANTAL_PAT_PER_SPC | High correlation |
AANTAL_PAT_PER_ZPD is highly correlated with AANTAL_SUBTRAJECT_PER_ZPD | High correlation |
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD | High correlation |
AANTAL_PAT_PER_DIAG is highly correlated with AANTAL_SUBTRAJECT_PER_DIAG | High correlation |
AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG | High correlation |
AANTAL_PAT_PER_SPC is highly correlated with AANTAL_SUBTRAJECT_PER_SPC | High correlation |
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with AANTAL_PAT_PER_SPC | High correlation |
AANTAL_PAT_PER_ZPD is highly correlated with AANTAL_SUBTRAJECT_PER_ZPD | High correlation |
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD | High correlation |
AANTAL_PAT_PER_DIAG is highly correlated with AANTAL_SUBTRAJECT_PER_DIAG | High correlation |
AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG | High correlation |
AANTAL_PAT_PER_SPC is highly correlated with AANTAL_SUBTRAJECT_PER_SPC | High correlation |
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with AANTAL_PAT_PER_SPC | High correlation |
VERSIE is highly correlated with DATUM_BESTAND and 1 other fields | High correlation |
DATUM_BESTAND is highly correlated with VERSIE and 1 other fields | High correlation |
PEILDATUM is highly correlated with VERSIE and 1 other fields | High correlation |
JAAR is highly correlated with AANTAL_PAT_PER_SPC and 1 other fields | High correlation |
AANTAL_PAT_PER_ZPD is highly correlated with AANTAL_SUBTRAJECT_PER_ZPD | High correlation |
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD | High correlation |
AANTAL_PAT_PER_DIAG is highly correlated with AANTAL_SUBTRAJECT_PER_DIAG | High correlation |
AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG | High correlation |
AANTAL_PAT_PER_SPC is highly correlated with JAAR and 1 other fields | High correlation |
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with JAAR and 1 other fields | High correlation |
GEMIDDELDE_VERKOOPPRIJS has 44736 (15.9%) missing values | Missing |
AANTAL_SUBTRAJECT_PER_ZPD is highly skewed (γ1 = 21.2566906) | Skewed |
Reproduction
| Analysis started | 2021-09-27 23:01:53.014029 |
|---|---|
| Analysis finished | 2021-09-27 23:02:22.070318 |
| Duration | 29.06 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 281880 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1.0 | 281880 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| 2021-09-13 |
|---|
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2021-09-13 |
|---|---|
| 2nd row | 2021-09-13 |
| 3rd row | 2021-09-13 |
| 4th row | 2021-09-13 |
| 5th row | 2021-09-13 |
Common Values
| Value | Count | Frequency (%) |
| 2021-09-13 | 281880 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2021-09-13 | 281880 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| 2021-09-01 |
|---|
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2021-09-01 |
|---|---|
| 2nd row | 2021-09-01 |
| 3rd row | 2021-09-01 |
| 4th row | 2021-09-01 |
| 5th row | 2021-09-01 |
Common Values
| Value | Count | Frequency (%) |
| 2021-09-01 | 281880 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2021-09-01 | 281880 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| Minimum | 2012-01-01 00:00:00 |
|---|---|
| Maximum | 2021-01-01 00:00:00 |
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 424.6211189 |
| Minimum | 301 |
|---|---|
| Maximum | 8418 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 301 |
|---|---|
| 5-th percentile | 302 |
| Q1 | 305 |
| median | 313 |
| Q3 | 322 |
| 95-th percentile | 335 |
| Maximum | 8418 |
| Range | 8117 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 933.4165885 |
|---|---|
| Coefficient of variation (CV) | 2.198234018 |
| Kurtosis | 69.20096538 |
| Mean | 424.6211189 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 8.431478186 |
| Sum | 119692201 |
| Variance | 871266.5277 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 305 | 39908 | |
| 313 | 36584 | |
| 303 | 32463 | |
| 330 | 22561 | 8.0% |
| 316 | 19192 | 6.8% |
| 308 | 14676 | 5.2% |
| 306 | 11727 | 4.2% |
| 324 | 11699 | 4.2% |
| 301 | 11461 | 4.1% |
| 304 | 9231 | 3.3% |
| Other values (17) | 72378 |
| Value | Count | Frequency (%) |
| 301 | 11461 | 4.1% |
| 302 | 6166 | 2.2% |
| 303 | 32463 | |
| 304 | 9231 | 3.3% |
| 305 | 39908 | |
| 306 | 11727 | 4.2% |
| 307 | 4900 | 1.7% |
| 308 | 14676 | 5.2% |
| 310 | 3165 | 1.1% |
| 313 | 36584 |
| Value | Count | Frequency (%) |
| 8418 | 3784 | 1.3% |
| 1900 | 186 | 0.1% |
| 390 | 759 | 0.3% |
| 389 | 3032 | 1.1% |
| 362 | 3977 | 1.4% |
| 361 | 1997 | 0.7% |
| 335 | 2878 | 1.0% |
| 330 | 22561 | |
| 329 | 748 | 0.3% |
| 328 | 6001 | 2.1% |
| Distinct | 1770 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| 101 | 1191 |
|---|---|
| 402 | 1165 |
| 403 | 1134 |
| 301 | 1125 |
| 203 | 1066 |
| Other values (1765) |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.349155669 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2401 |
|---|---|
| 2nd row | 2401 |
| 3rd row | 2405 |
| 4th row | 2405 |
| 5th row | 2405 |
Common Values
| Value | Count | Frequency (%) |
| 101 | 1191 | 0.4% |
| 402 | 1165 | 0.4% |
| 403 | 1134 | 0.4% |
| 301 | 1125 | 0.4% |
| 203 | 1066 | 0.4% |
| 201 | 1060 | 0.4% |
| 401 | 947 | 0.3% |
| 404 | 946 | 0.3% |
| 802 | 926 | 0.3% |
| 409 | 920 | 0.3% |
| Other values (1760) | 271400 |
Length
| Value | Count | Frequency (%) |
| 101 | 1191 | 0.4% |
| 402 | 1165 | 0.4% |
| 403 | 1134 | 0.4% |
| 301 | 1125 | 0.4% |
| 203 | 1066 | 0.4% |
| 201 | 1060 | 0.4% |
| 401 | 947 | 0.3% |
| 404 | 946 | 0.3% |
| 802 | 926 | 0.3% |
| 409 | 920 | 0.3% |
| Other values (1760) | 271400 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
ZORGPRODUCT_CD
Real number (ℝ≥0)
| Distinct | 5933 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 439730590.5 |
| Minimum | 10501002 |
|---|---|
| Maximum | 998418081 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 10501002 |
|---|---|
| 5-th percentile | 28999037 |
| Q1 | 99799028 |
| median | 149599019 |
| Q3 | 990004004 |
| 95-th percentile | 990516015 |
| Maximum | 998418081 |
| Range | 987917079 |
| Interquartile range (IQR) | 890204976 |
Descriptive statistics
| Standard deviation | 428873091.2 |
|---|---|
| Coefficient of variation (CV) | 0.9753087468 |
| Kurtosis | -1.732662051 |
| Mean | 439730590.5 |
| Median Absolute Deviation (MAD) | 119600013 |
| Skewness | 0.4724661679 |
| Sum | 1.239512589 × 1014 |
| Variance | 1.839321284 × 1017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 990004009 | 2089 | 0.7% |
| 990004007 | 2049 | 0.7% |
| 990003004 | 1975 | 0.7% |
| 990004006 | 1637 | 0.6% |
| 990356076 | 1483 | 0.5% |
| 990356073 | 1359 | 0.5% |
| 990003007 | 1274 | 0.5% |
| 131999228 | 1270 | 0.5% |
| 131999164 | 1249 | 0.4% |
| 199299013 | 1188 | 0.4% |
| Other values (5923) | 266307 |
| Value | Count | Frequency (%) |
| 10501002 | 7 | |
| 10501003 | 10 | |
| 10501004 | 10 | |
| 10501005 | 10 | |
| 10501007 | 3 | < 0.1% |
| 10501008 | 10 | |
| 10501010 | 10 | |
| 10501011 | 3 | < 0.1% |
| 11101002 | 9 | |
| 11101003 | 10 |
| Value | Count | Frequency (%) |
| 998418081 | 135 | |
| 998418080 | 122 | |
| 998418079 | 35 | < 0.1% |
| 998418077 | 7 | < 0.1% |
| 998418076 | 7 | < 0.1% |
| 998418075 | 6 | < 0.1% |
| 998418074 | 186 | |
| 998418073 | 186 | |
| 998418072 | 7 | < 0.1% |
| 998418071 | 7 | < 0.1% |
AANTAL_PAT_PER_ZPD
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 9326 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 503.0807365 |
| Minimum | 1 |
|---|---|
| Maximum | 163749 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 13 |
| Q3 | 100 |
| 95-th percentile | 1688 |
| Maximum | 163749 |
| Range | 163748 |
| Interquartile range (IQR) | 97 |
Descriptive statistics
| Standard deviation | 3142.423395 |
|---|---|
| Coefficient of variation (CV) | 6.246360012 |
| Kurtosis | 405.490553 |
| Mean | 503.0807365 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 16.75773128 |
| Sum | 141808398 |
| Variance | 9874824.794 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 47009 | 16.7% |
| 2 | 23023 | 8.2% |
| 3 | 15075 | 5.3% |
| 4 | 11030 | 3.9% |
| 5 | 8493 | 3.0% |
| 6 | 7268 | 2.6% |
| 7 | 6025 | 2.1% |
| 8 | 5067 | 1.8% |
| 9 | 4686 | 1.7% |
| 10 | 4129 | 1.5% |
| Other values (9316) | 150075 |
| Value | Count | Frequency (%) |
| 1 | 47009 | |
| 2 | 23023 | |
| 3 | 15075 | 5.3% |
| 4 | 11030 | 3.9% |
| 5 | 8493 | 3.0% |
| 6 | 7268 | 2.6% |
| 7 | 6025 | 2.1% |
| 8 | 5067 | 1.8% |
| 9 | 4686 | 1.7% |
| 10 | 4129 | 1.5% |
| Value | Count | Frequency (%) |
| 163749 | 1 | |
| 155870 | 1 | |
| 154272 | 1 | |
| 145011 | 1 | |
| 144726 | 1 | |
| 116984 | 1 | |
| 115605 | 1 | |
| 110208 | 1 | |
| 109677 | 1 | |
| 108959 | 1 |
AANTAL_SUBTRAJECT_PER_ZPD
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWED| Distinct | 9999 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 591.3416241 |
| Minimum | 1 |
|---|---|
| Maximum | 239907 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 14 |
| Q3 | 109 |
| 95-th percentile | 1915 |
| Maximum | 239907 |
| Range | 239906 |
| Interquartile range (IQR) | 106 |
Descriptive statistics
| Standard deviation | 4004.588702 |
|---|---|
| Coefficient of variation (CV) | 6.772039273 |
| Kurtosis | 717.701999 |
| Mean | 591.3416241 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 21.2566906 |
| Sum | 166687377 |
| Variance | 16036730.67 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 45329 | 16.1% |
| 2 | 22637 | 8.0% |
| 3 | 14933 | 5.3% |
| 4 | 10829 | 3.8% |
| 5 | 8441 | 3.0% |
| 6 | 7234 | 2.6% |
| 7 | 5980 | 2.1% |
| 8 | 5025 | 1.8% |
| 9 | 4638 | 1.6% |
| 10 | 4104 | 1.5% |
| Other values (9989) | 152730 |
| Value | Count | Frequency (%) |
| 1 | 45329 | |
| 2 | 22637 | |
| 3 | 14933 | 5.3% |
| 4 | 10829 | 3.8% |
| 5 | 8441 | 3.0% |
| 6 | 7234 | 2.6% |
| 7 | 5980 | 2.1% |
| 8 | 5025 | 1.8% |
| 9 | 4638 | 1.6% |
| 10 | 4104 | 1.5% |
| Value | Count | Frequency (%) |
| 239907 | 1 | |
| 232484 | 1 | |
| 231390 | 1 | |
| 227658 | 1 | |
| 221521 | 1 | |
| 218623 | 1 | |
| 216424 | 1 | |
| 212709 | 1 | |
| 208634 | 1 | |
| 204748 | 1 |
AANTAL_PAT_PER_DIAG
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 8203 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7563.262108 |
| Minimum | 1 |
|---|---|
| Maximum | 226763 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 38 |
| Q1 | 381 |
| median | 1660 |
| Q3 | 6160 |
| 95-th percentile | 36249 |
| Maximum | 226763 |
| Range | 226762 |
| Interquartile range (IQR) | 5779 |
Descriptive statistics
| Standard deviation | 17744.97243 |
|---|---|
| Coefficient of variation (CV) | 2.346206197 |
| Kurtosis | 34.07260824 |
| Mean | 7563.262108 |
| Median Absolute Deviation (MAD) | 1516 |
| Skewness | 5.080534233 |
| Sum | 2131932323 |
| Variance | 314884046.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 21 | 460 | 0.2% |
| 19 | 458 | 0.2% |
| 8 | 454 | 0.2% |
| 12 | 450 | 0.2% |
| 37 | 449 | 0.2% |
| 9 | 442 | 0.2% |
| 28 | 440 | 0.2% |
| 17 | 422 | 0.1% |
| 23 | 416 | 0.1% |
| 4 | 416 | 0.1% |
| Other values (8193) | 277473 |
| Value | Count | Frequency (%) |
| 1 | 334 | |
| 2 | 363 | |
| 3 | 379 | |
| 4 | 416 | |
| 5 | 372 | |
| 6 | 390 | |
| 7 | 341 | |
| 8 | 454 | |
| 9 | 442 | |
| 10 | 318 |
| Value | Count | Frequency (%) |
| 226763 | 23 | |
| 213509 | 25 | |
| 212038 | 17 | |
| 210804 | 17 | |
| 210440 | 19 | |
| 205458 | 24 | |
| 204673 | 17 | |
| 200179 | 16 | |
| 198534 | 20 | |
| 189111 | 19 |
AANTAL_SUBTRAJECT_PER_DIAG
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 9074 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10796.6829 |
| Minimum | 1 |
|---|---|
| Maximum | 366095 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 48 |
| Q1 | 499 |
| median | 2276 |
| Q3 | 8777 |
| 95-th percentile | 51048 |
| Maximum | 366095 |
| Range | 366094 |
| Interquartile range (IQR) | 8278 |
Descriptive statistics
| Standard deviation | 26171.16459 |
|---|---|
| Coefficient of variation (CV) | 2.424000486 |
| Kurtosis | 37.99419026 |
| Mean | 10796.6829 |
| Median Absolute Deviation (MAD) | 2096 |
| Skewness | 5.338255862 |
| Sum | 3043368975 |
| Variance | 684929855.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 376 | 0.1% |
| 17 | 369 | 0.1% |
| 38 | 361 | 0.1% |
| 4 | 356 | 0.1% |
| 19 | 355 | 0.1% |
| 25 | 351 | 0.1% |
| 24 | 339 | 0.1% |
| 82 | 338 | 0.1% |
| 23 | 336 | 0.1% |
| 46 | 335 | 0.1% |
| Other values (9064) | 278364 |
| Value | Count | Frequency (%) |
| 1 | 283 | |
| 2 | 299 | |
| 3 | 327 | |
| 4 | 356 | |
| 5 | 321 | |
| 6 | 318 | |
| 7 | 314 | |
| 8 | 309 | |
| 9 | 255 | |
| 10 | 320 |
| Value | Count | Frequency (%) |
| 366095 | 23 | |
| 348460 | 25 | |
| 341708 | 19 | |
| 323800 | 20 | |
| 320849 | 24 | |
| 311873 | 17 | |
| 309713 | 17 | |
| 297717 | 17 | |
| 288416 | 16 | |
| 267042 | 19 |
AANTAL_PAT_PER_SPC
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 269 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 660918.4521 |
| Minimum | 227 |
|---|---|
| Maximum | 1489503 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 227 |
|---|---|
| 5-th percentile | 42210 |
| Q1 | 246242 |
| median | 744717 |
| Q3 | 995483 |
| 95-th percentile | 1332889 |
| Maximum | 1489503 |
| Range | 1489276 |
| Interquartile range (IQR) | 749241 |
Descriptive statistics
| Standard deviation | 426367.1097 |
|---|---|
| Coefficient of variation (CV) | 0.6451130367 |
| Kurtosis | -1.186683755 |
| Mean | 660918.4521 |
| Median Absolute Deviation (MAD) | 326629 |
| Skewness | 0.02661943634 |
| Sum | 1.862996933 × 1011 |
| Variance | 1.817889122 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 880968 | 5102 | 1.8% |
| 874290 | 4354 | 1.5% |
| 843993 | 4348 | 1.5% |
| 894414 | 4333 | 1.5% |
| 880560 | 4273 | 1.5% |
| 890713 | 4209 | 1.5% |
| 719212 | 4007 | 1.4% |
| 1084171 | 3890 | 1.4% |
| 1096221 | 3859 | 1.4% |
| 1063689 | 3851 | 1.4% |
| Other values (259) | 239654 |
| Value | Count | Frequency (%) |
| 227 | 7 | < 0.1% |
| 1531 | 121 | < 0.1% |
| 1609 | 130 | < 0.1% |
| 1923 | 131 | < 0.1% |
| 2004 | 196 | |
| 2316 | 64 | < 0.1% |
| 2497 | 173 | |
| 4340 | 81 | < 0.1% |
| 4413 | 297 | |
| 6811 | 380 |
| Value | Count | Frequency (%) |
| 1489503 | 2976 | |
| 1450623 | 3054 | |
| 1421848 | 3564 | |
| 1345233 | 3543 | |
| 1332889 | 3546 | |
| 1328827 | 3436 | |
| 1317382 | 3463 | |
| 1296723 | 1181 | 0.4% |
| 1283083 | 3577 | |
| 1262595 | 1201 | 0.4% |
AANTAL_SUBTRAJECT_PER_SPC
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 269 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1058151.97 |
| Minimum | 230 |
|---|---|
| Maximum | 2634761 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 230 |
|---|---|
| 5-th percentile | 47081 |
| Q1 | 356417 |
| median | 1037070 |
| Q3 | 1729107 |
| 95-th percentile | 2488664 |
| Maximum | 2634761 |
| Range | 2634531 |
| Interquartile range (IQR) | 1372690 |
Descriptive statistics
| Standard deviation | 745271.4756 |
|---|---|
| Coefficient of variation (CV) | 0.7043142161 |
| Kurtosis | -0.9179410898 |
| Mean | 1058151.97 |
| Median Absolute Deviation (MAD) | 692037 |
| Skewness | 0.323116285 |
| Sum | 2.982718774 × 1011 |
| Variance | 5.554295723 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1211813 | 5102 | 1.8% |
| 1281752 | 4354 | 1.5% |
| 1216292 | 4348 | 1.5% |
| 1315715 | 4333 | 1.5% |
| 1300634 | 4273 | 1.5% |
| 1327217 | 4209 | 1.5% |
| 1073477 | 4007 | 1.4% |
| 2556984 | 3890 | 1.4% |
| 2634761 | 3859 | 1.4% |
| 2488664 | 3851 | 1.4% |
| Other values (259) | 239654 |
| Value | Count | Frequency (%) |
| 230 | 7 | < 0.1% |
| 1739 | 121 | < 0.1% |
| 1862 | 130 | < 0.1% |
| 2039 | 196 | |
| 2200 | 131 | < 0.1% |
| 2356 | 64 | < 0.1% |
| 2819 | 173 | |
| 4346 | 81 | < 0.1% |
| 4424 | 297 | |
| 7390 | 380 |
| Value | Count | Frequency (%) |
| 2634761 | 3859 | |
| 2594635 | 3845 | |
| 2556984 | 3890 | |
| 2488664 | 3851 | |
| 2398590 | 3708 | |
| 2184421 | 3757 | |
| 2066343 | 3810 | |
| 2028939 | 1168 | 0.4% |
| 1985491 | 1167 | 0.4% |
| 1978552 | 3691 |
| Distinct | 3291 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 44736 |
| Missing (%) | 15.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3499.49206 |
| Minimum | 0 |
|---|---|
| Maximum | 287220 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 140 |
| Q1 | 460 |
| median | 1215 |
| Q3 | 4015 |
| 95-th percentile | 13275 |
| Maximum | 287220 |
| Range | 287220 |
| Interquartile range (IQR) | 3555 |
Descriptive statistics
| Standard deviation | 6546.109176 |
|---|---|
| Coefficient of variation (CV) | 1.870588378 |
| Kurtosis | 163.0344839 |
| Mean | 3499.49206 |
| Median Absolute Deviation (MAD) | 990 |
| Skewness | 7.67066718 |
| Sum | 829883545 |
| Variance | 42851545.34 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 105 | 1859 | 0.7% |
| 160 | 1856 | 0.7% |
| 110 | 1548 | 0.5% |
| 145 | 1439 | 0.5% |
| 180 | 1344 | 0.5% |
| 300 | 1278 | 0.5% |
| 190 | 1223 | 0.4% |
| 120 | 1223 | 0.4% |
| 185 | 1212 | 0.4% |
| 165 | 1211 | 0.4% |
| Other values (3281) | 222951 | |
| (Missing) | 44736 | 15.9% |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 70 | 226 | 0.1% |
| 75 | 76 | < 0.1% |
| 80 | 361 | 0.1% |
| 85 | 920 | |
| 90 | 568 | 0.2% |
| 95 | 686 | 0.2% |
| 100 | 916 | |
| 105 | 1859 | |
| 110 | 1548 |
| Value | Count | Frequency (%) |
| 287220 | 8 | |
| 148910 | 3 | < 0.1% |
| 142835 | 4 | |
| 122155 | 4 | |
| 116765 | 3 | < 0.1% |
| 109725 | 7 | |
| 108570 | 7 | |
| 107655 | 4 | |
| 101270 | 8 | |
| 95465 | 7 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| VERSIE | DATUM_BESTAND | PEILDATUM | JAAR | BEHANDELEND_SPECIALISME_CD | TYPERENDE_DIAGNOSE_CD | ZORGPRODUCT_CD | AANTAL_PAT_PER_ZPD | AANTAL_SUBTRAJECT_PER_ZPD | AANTAL_PAT_PER_DIAG | AANTAL_SUBTRAJECT_PER_DIAG | AANTAL_PAT_PER_SPC | AANTAL_SUBTRAJECT_PER_SPC | GEMIDDELDE_VERKOOPPRIJS | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1.0 | 2021-09-13 | 2021-09-01 | 2012-01-01 | 308 | 2401 | 19999009 | 4 | 4 | 18 | 18 | 74661 | 107866 | NaN |
| 1 | 1.0 | 2021-09-13 | 2021-09-01 | 2012-01-01 | 308 | 2401 | 19999010 | 10 | 10 | 18 | 18 | 74661 | 107866 | NaN |
| 2 | 1.0 | 2021-09-13 | 2021-09-01 | 2012-01-01 | 308 | 2405 | 19999014 | 2 | 2 | 53 | 53 | 74661 | 107866 | NaN |
| 3 | 1.0 | 2021-09-13 | 2021-09-01 | 2012-01-01 | 308 | 2405 | 19999016 | 1 | 1 | 53 | 53 | 74661 | 107866 | NaN |
| 4 | 1.0 | 2021-09-13 | 2021-09-01 | 2012-01-01 | 308 | 2405 | 19999017 | 1 | 1 | 53 | 53 | 74661 | 107866 | NaN |
| 5 | 1.0 | 2021-09-13 | 2021-09-01 | 2012-01-01 | 308 | 2405 | 19999020 | 9 | 9 | 53 | 53 | 74661 | 107866 | NaN |
| 6 | 1.0 | 2021-09-13 | 2021-09-01 | 2012-01-01 | 308 | 2405 | 19999021 | 40 | 40 | 53 | 53 | 74661 | 107866 | NaN |
| 7 | 1.0 | 2021-09-13 | 2021-09-01 | 2012-01-01 | 308 | 2401 | 19999025 | 4 | 4 | 18 | 18 | 74661 | 107866 | NaN |
| 8 | 1.0 | 2021-09-13 | 2021-09-01 | 2017-01-01 | 307 | M13 | 20108044 | 230 | 231 | 3315 | 6410 | 690915 | 1157783 | 12925.0 |
| 9 | 1.0 | 2021-09-13 | 2021-09-01 | 2017-01-01 | 307 | M13 | 20108045 | 2 | 2 | 3315 | 6410 | 690915 | 1157783 | NaN |
Last rows
| VERSIE | DATUM_BESTAND | PEILDATUM | JAAR | BEHANDELEND_SPECIALISME_CD | TYPERENDE_DIAGNOSE_CD | ZORGPRODUCT_CD | AANTAL_PAT_PER_ZPD | AANTAL_SUBTRAJECT_PER_ZPD | AANTAL_PAT_PER_DIAG | AANTAL_SUBTRAJECT_PER_DIAG | AANTAL_PAT_PER_SPC | AANTAL_SUBTRAJECT_PER_SPC | GEMIDDELDE_VERKOOPPRIJS | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 281870 | 1.0 | 2021-09-13 | 2021-09-01 | 2018-01-01 | 316 | 3504 | 991630064 | 4 | 5 | 17 | 22 | 447081 | 761995 | NaN |
| 281871 | 1.0 | 2021-09-13 | 2021-09-01 | 2018-01-01 | 316 | 3504 | 991630065 | 10 | 11 | 17 | 22 | 447081 | 761995 | 205.0 |
| 281872 | 1.0 | 2021-09-13 | 2021-09-01 | 2018-01-01 | 316 | 3520 | 991630069 | 1 | 1 | 3385 | 5138 | 447081 | 761995 | NaN |
| 281873 | 1.0 | 2021-09-13 | 2021-09-01 | 2018-01-01 | 316 | 3518 | 991630069 | 1 | 1 | 1791 | 2592 | 447081 | 761995 | NaN |
| 281874 | 1.0 | 2021-09-13 | 2021-09-01 | 2018-01-01 | 316 | 7610 | 991630070 | 1 | 1 | 116 | 151 | 447081 | 761995 | 2660.0 |
| 281875 | 1.0 | 2021-09-13 | 2021-09-01 | 2018-01-01 | 316 | 3521 | 991630070 | 1 | 1 | 289 | 374 | 447081 | 761995 | 2660.0 |
| 281876 | 1.0 | 2021-09-13 | 2021-09-01 | 2018-01-01 | 316 | 3518 | 991630070 | 1 | 1 | 1791 | 2592 | 447081 | 761995 | 2660.0 |
| 281877 | 1.0 | 2021-09-13 | 2021-09-01 | 2018-01-01 | 316 | 3517 | 991630070 | 1 | 1 | 1085 | 1557 | 447081 | 761995 | 2660.0 |
| 281878 | 1.0 | 2021-09-13 | 2021-09-01 | 2018-01-01 | 316 | 3522 | 991630070 | 1 | 1 | 1357 | 1886 | 447081 | 761995 | 2660.0 |
| 281879 | 1.0 | 2021-09-13 | 2021-09-01 | 2018-01-01 | 316 | 3520 | 991630070 | 15 | 16 | 3385 | 5138 | 447081 | 761995 | 2660.0 |